Connection  Machine"  Family:  hirallcl  Computers  thai  are  K&sy  to  Program. 


Thinking  Machines  Corporation  produces 
a  family  of  higli  performance  computer 
systems.  The  largest  member  of  ihe  fam- 
ily is  the  fi  l.noo processor  CM-j!  with 
performance  in  excess  of  Zr>00  Mips,  and 
floating  point  performance  above  2\W 
MKIops.  The  systems  are  programmed 
through  familiar  programming  environ- 
ments, like  I -NIX,  using  parallel  exten- 
sions of  conventional  languages,  like 
Fortran.  Lisp.  andC.  Connection  Machine 
systems  are  currently  being  used  in  a 
range  of  applications  including  data  base 
retrieval,  image  processing,  computer 
aided  design  and  floating-point  intensive 
scientific  calculations. 


The  Connection  Machine  system  is 
easier  to  program  than  other  parallel  com- 
puters because  it  supports  a  data  parallel 
style  of  computation.  This  style  allows 
users  to  express  a  parallel  compulation  in 
an  :itiral  way.  operating  on  a  II  Ihedalaat 
once,  without  worrying  about  issues  like 
partitioningand  synchronization.  The 
machine's  general  commuiiical  ions  net- 
work automatically  adapts  lo  Ihe  structure 
of  die  application. 

The  Conned  ion  Machine  coi  npi  iter 
contains  (il.ooo  processors  connected  by 
a  general  communication  network.  The 
air-cooled  system  presents  no  special 
environmental  requirements. 

The  I  )ataVaull '  mass  storage  system 
provides  lotibylesof  fault  tolerant  mass 
storage  connected  to  one  of  the  Connec- 
tion Machine  high  bandwidth  I.  ( )  channels. 

The  high  resolution  graphics  display 
loads  from  Connection  Machine  memory 
at  1  (ibit  per  second. 


Data  Parallel  Computing: 


Looking  at  the  whole  problem  at  once 
means  computing  on  every  element  of  a 
data  structure  at  the  same  time.  The  Con- 
nection Machine  system's  tens  of  thou- 
sands of  processors  allow  it  to  do  this  in  a 
straightforward  way:  by  attaching  a  sepa- 
rate processor  to  each  element  of  a  data 
structure. 

The  type  of  data  structure  depends  on 
the  application.  In  many  language  process- 
ing applications,  data  parallelism  means  a 
processor  for  every  word  or  every  meaning. 
For  data  base  applications,  it  means  a  pro- 
cessor for  every  document.  In  a  numeric 
simulation,  it  may  mean  a  processor  for 
every  element  of  amatrix.  And  in  image 
processing,  data  parallelism  means  a  pro- 
cessor for  every  picture  element,  or  pixel, 
in  an  image. 

The  performance  advantages  of  data 
parallelism  are  dramatic.  The  parallelism 
inherent  in  the  data  structures  grows  with 
the  size  of  the  data.  Processing  all  the  pix- 
els in  an  image,  for  example,  is  as  fast  as 
processing  one  pixel,  because  they  can  all 
be  computed  at  the  same  time.  A  whole 
data  base  can  be  searched  in  the  time  it 
takes  to  search  one  document.  When  com- 
puting the  flow  of  air  over  an  airplane  wing, 
the  Connection  Machine  system  calcu- 
lates the  flow  over  all  parts  of  the  wing 


simultaneously,  just  as  it  works  in  reality. 
Speeriups  of  1000  or  more  are  common 
with  data  parallelism. 

Il  data  parallelism  is  so  simple  and  pow- 
erful, why  don't  all  big  computers  work 
this  way?  Because  in  order  to  make  these 
programs  work,  t  ens  of  t  Ik  uisaiK  Is  <  if  pn  >- 
cessors  have  to  work  together.  They  have 
to  communicate.  The  Connection  Machine 
system  solves  the  communications  prob- 
lem in  hardware  so  the  natural  algorithms 
work  well. 

What  makes  the  Conned  ion  Machine 
system  work  are  the  connections.  Inside 
the  machine  there  is  a  very  flexible  high- 
bandwidth  communication  network  that 
moves  data  between  processors  at  billions 
of  bits  per  second.  Homing  circuits  on 
every  chip  automatically  steer  data  along 
the  fastest  paths,  helping  to  make  pro- 
gramming simpler.  There  is  no  need  to 
adapt  your  application  to  the  structure  of  a 
fixed  architecture  like  a  grid,  ring,  hyper- 
cube,  or  tree.  Instead,  the  Connection 
Machine  system  adapts  to  yourapplicii- 
tion,  by  dynamically  forming  the  connec- 
tions that  are  needed. 


CM-2  Specifications 


Typical  Application  Performance* 

(fixed  point) 

General  Computing               2500  Mips 
Terrain  Mapping                  1000  Mips 
Document  Search                6000  Mips 

General  SnpHfiratinn« 

Processors  65,536 
Memory                         512  Mbytes 
Memory  Bandwidth               300  Gbits 

per  second 

Interprocessor  Communication 

(32-bit  messages) 

RegularPattern      250  million  per  second 

Random  PattPTTl           8fl  million  npreerrmrl 

Sort  65, 536  32-bit  keys       30  milliseconds 

Input/Output  Channels 

Number  of  Channels  8 
Capacity  per  Channel            40  Mbytes 

per  second 

Maximum  Transfer  Rate        320  Mbytes 

per second 

Variable  Precision  Fixed  Point 

64-bit  integer  add                  1500  Mips 
32-bit  integer  add                  2500  Mips 
16-bit  integer  add                 3300  Mips 
8-bit  integer  add                  4000  Mips 
64-bit  move                        2000  Mips 
32-bit-move                       3000  Mips 
16-bit  move                        3800  Mips 
8-bit  move                         4500  Mips 

Physical  Dimensions 

OI/^e                               bb  xoo  x62 
Weight                            2,600  lbs. 

Environmental  Requirements 
(does  not  include  host) 

Power  Dissipation                    28  KW 
Power  Input  Four30-amp3-phase 

110/208v 

Operating 

Temperature                  70°F  ±  5°F 
Operating  Relative 
Humidity                    50%  ±  10% 

Double  Precision  Floating  Point 

Average  (4K  x  4K  matrix 

multiply)  2500MFlops 
Dot  product  5000MFlops 

Single  Precision  Floating  Point 

Average  (4K  x  4K  matrix 

multiply)  3500MFlops 
Dot  product  10,000MFlops 

Thinking  Machines  Corporation  believes  all  specifications 
are  accurate  as  of  the  date  of  publication.  Thinking  Machines 
Corporation  cannot,  however,  beresponsibleforinadvertent 
errors.  Product  specifications  are  subject  to  change  without 
notice.  For  further  detail,  see  the  Product  Specification  Sheet. 
Mips  =  Millions  of  instructions  per  second;  MFlops  = 
Millions  of  floating  point  operations  per  second. 

System  Components 


Pmjfnimminjj ■Environment: 

TIk>  user  inl  cracl  s  wit  h  1  Ik-  Conned  u  n  i 

Machine  system  through  a  conditional 
front  end  system,  smii ;is ;i  1  )ijrit;il  VAX 
or;i  Symbolics       The  system  is  pro- 
grammed vi;i  the  front  end.  usinu  familiar 
editors and  utilities.  File  structures  and 
network  protocols  are  supported  I  here  ;is 
well.  ;is  ;ire  the  lull  ranjje  ol'st;ind;ird  VAX 
;md  Symbolics  peripherals. 
Languages: 

C:;:  supports  the  data  parallel  programming 
slylewhilemaldngmininKil  extensions  to 
the  C  language  itself.  ( )ne  new  parallel  data 
type  is  introduced. 

Lisp  is  supported  ai  two  levels.  CM-Lisp 
offers  fully  automatic  allocat  ion  o| 'parallel 
data  structures. :;: Lisp  allows  users  to  yet 
closer  I o  tin-  hardware.  Both  CM-Lisp  and 
:  :Lisp  are  extensions  of  Common  Lisp. 

Fortran  supports  the  data  parallel  pro- 
gramming style  by  using  I'brlran  77  with 
vector  and  control  extensions  mcelingilv 
emerging  standard  of  Fortran  Hx. 


Mass  Storage: 

I  )ata\aull  provides  highly  fault  tolerant 
storage  for  Conned  ion  Machine  data. 
I  JataVaulis  are  available  in  !">  or  K)  (Jbyles 
with  a  K  i  Mbyte  per  second  transfer  rate. 
I  !p  to  eight  I  )ataVaulls  may  be  used  in  par- 
allel, for  a  total  transfer  rate  ol':yo  Mbytes 
per  second.  Data  is  redundantly  stored 
across  multiple  disk  units  so  that  any  sin- 
tile  unit  can  fail  without  loss  of  informal  ion. 
(iraphics  Display: 

The  systems  high  resolution  graphic 
display  can  be  loaded  from  Connection 
Machine  memory  at  1  Gbil  per  second,  fast 
enough  for  real  time  animation.  The  dis- 
play stores :i  \2H0x  m\ color  in  ia<je.  with 
LM  bit  planes. 
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